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l 

cgactggagcacgaggacactgacatggactgaaggagtagaaaggtttaaaaaacccagtttgagaa ATG TCT AAG TCT GGG CTT CAT CTT 

MSKSGLHL 

93 

GTA GCC TAC ATA TTA TTG ATA TTT TTA ATT TCA ACT AAT ATA GCA TCT AAA ATT TCT GGT GTT CCA TTA TGC AAC 

VAYILLIFLISTNIASKISGVPLCN 

168 

AAA GAT ACT TCA CCA GTA TTT ACA CTT CAA CAT AAT TCT ACT AAT GGT ATT TTA GCT AGA TCT CTT CCA CAA CCA 
KDTS PVFTLQHNSTNGILARSLPQP 
243 

GGA TTA ATT GAT TGT TCA GAA CAT TGT TCC TCT TCG TCA GAT TGT ATT GGC GTT GAA TAT TGG CAG GGA ATT TGT 
GLIDCSEHCS .SSSDCIGVEYWQGIC 
318 

AGA GTT ATT TCT CAA GAT AAA ACT TCT ATT TAT ACA CCA ACA GAT GAA ACT TCA ATA CTT TTA ACA AAA TCA TGT 
RVI SQDKTSIYTPTDETSILLTKSC 
393 

GTT AAA AGT GAT CGT ATA TGT TCA TCA CCA TTC CAT TTT GAT GTT TAT GAA CAA AAA ATA TTA GTT GGA TTT GCT 

VKSDRICSSPFHFDVYEQKILVGFA 

468 

AGA GAA GTT GTA CCA GCT GAG TCT ATT GAA ATT TGT ATG GCT GCT TGT TTG AAT GCT TTT GAT ACA TAT GGT TTT 
REVVPAES I EI CMAACLNAFDTYGF 
543 

GAA TGT GAA TCA GCT ATG TAT TAT CCA GTT GAT AGT GAA TGT ATT CTT AAT ACT GAA GAT AGA CTT GAT CGA CCA 
ECESAMYYPVDSECI LNTEDRLDRP 
618 

GAT CTT TTT GTT GTT GAA AAA GAA GAT GTT GTT TAT TAT CTT GAT TCT AAT TGT GCT GGT TCA CAA TGT TAT GCT 

DLFVVEKEDVVYYLDSNCAGSQCYA 

693 

CCA TAC ATT ACA CAA TAT ATT GCT GTT GAA AAT AAA CAA ATA GAA AAT GAA TTA GAT AGA AAA TTT GAA AAT ATT 

PYITQYIAVENKQIENELDRKFENI 

768 

GAT TTC CAA ACA TGT GAA GAA TTA TGT ACT GGT AGA ATT ACT GTT ACA CAA AAT GAT TTT ACT TGT AAA TCA TTT 

DFQTCEELCTGRITVTQNDFTCKSF 

843 

ATG TAT AAT CCT GAA ACA AAA GTT TGT TAT CTT TCT GAT GAA CGT TCA AAG CCT CTT GGA CGG GCT AAA TTA AGT 

MYNPETKVCYLSDERSKPLGRAKLS 

918 

GAT GCT AAT GGA TTT ACT TAT TAT GAA AAA AAA TGT TTT GCA TCT CCA AGA ACA TGC CGT CAA ACA CCA TCA TTT 
DANG FTYYEKKCFASPRTCRQTPS F 
993 

AAT AGA GTA CCA CAA ATG ATT CTT GTT GGT TTT GCT GCA TTT GTT ATG GAA AAT GTA CCA TCT GTT ACT ATG TGC 
NRVPQMI LVGFAAFVMENVPSVTMC 
1068 

CTT GAT CAA TGT ACA AAT CCA CCA CCA GAG ACA GGT GAA AAA TTT GTC TGT AAA TCT GTT ATG TAC TAT TAT AAT 

LDQCTNPPPETGEKFVCKSVMYYYN 

1143 

GAA CAA GAA TGT ATT CTT AAT GCT GAA ACA AGA CAT ACA AAG CCA GAT CTT TTT ATT ACA GAA GGA GAT GAA TTT 

EQECILNAETRHTKPDLFITEGDEF 

1218 

CTT GTT GAT TAT TTT GAT ATT TCA TGT CAT CTT GAA CCA GAA ACA TGT CCT AAA GGA ACA TAT TTA AAA GGA ATT 
LVDYFD ISCHLEPETCPKGTYLKGI 
1293 

AAA TCT ATC AAT TCT GCA CTT CCT GAG GGT GAA GGC TCA CTT CAT GTT ATT GAG TCT GCT GGA AAA TCA TTA GAA 

KSINSALPEGEGSLHVIESAGKSLE 

1368 

GAA TGT ATG GAA AAA TGT AAC CAA CTT CAT CCA GAA AAA TGT AGA TCA TTT AAT TTT GAA AAA TCA TCT GGA TTA 

ECMEKCNQLHPEKCRSFNFEKSSGL 

1443 

TGT AAT CTT TTA TAT CTT GAT GGA AAA AAT ACT TTA AAA CCA TTT ATT AAA AAT GGA TTT GAT CTT GTT GAT TTA 
CNLLYLDGKNTLKPFI KNGFDLVDL 
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1518 

CAA TGT TTA TCA ACT AAA AAA GAT TGC TCT ACA AAA AAG AAT GAT ATT AAT TTT GTT AAA TAT CTT TAC TCT CAT 
QCLS TKKDCSTKKNDINFVKYLYSH 
1593 

TTT GTT AAA TAT CTT TAC TCT CAA CAA CCT GGA ATT CCA ACA AAA ACA GAA AAA GTT ATT GGT ATT TCT AAA TGT 

FVKYLYSQQPGIPTKTEKVIGISKC 

1668 

CTT GAT TTA TGT ACT GAT AGT GAA CGT TGT GAA GGA CTT AAT TAT AAT AGA AGA ACT GGA GAA TGT CAA TTA TTT 

LDLCTDSERCEGLNYNRRTGECQLF 

1743 

GAA ATT ATT GAT GGA CCT TCT AAT CTT AAA AAA TCT GAG CAT ATA GAT TTT TAT CAA AAT CTT TGT TCT ACT AAA 

EIIDGPSNLKKSEHIDFYQNLCSTK 

1818 

GAA AAT GAA GCT GGT GTT TCA TCT GCA TTA AAT GTA CCA CAA TCA TCT GTT ATT CCT ATT TCA TCA TCA CAA AAT 

ENEAGVSSALNVPQSSVIPISSSQN 

1893 

ATT AGT AAA AGT GAT GTT TTT GCC AAA AAA AAT CTT AAT AAA GAT GGT AAT AAT CAA GTA AAC ATT TAT GAA CCA 

ISKSDVFAKKNLNKDGNNQVNIYEP 

1968 

GAA AAA AAA TAC CAT CCA AAA GGA TCA AAA AAT GAA ACA TCA TAT GAA ACA GGA ACT GTA AAT AAA TCA AAT GTT 

EKKYHPKGSKNETSYETGTVNKSNV 

2043 

GAA GAG GTT TCT GAA ACT TTA ACT AAT AGT GGA GTT GAA AGT GGA AGT CTT GAA AAA AAT ATT ATT ACA GCA CCA 
EEVSETLTNSGVESGSLEKNI ITAP 
2118 

CCA TCT ATA CCA AAA ATT CCT GAA GGT CCA CTA CCA GTG CCA ATT TTA ATT CCA GCT GAT CAA GTA CAA ACT ATT 

PSIPKIPEGPLPVPILIPADQVQTI 

2193 

TGT GAT TAT GAA GGT ATT AAA GTA CAA ATT AAA TCA CCA CAA TCA TTT ACT GGT GTT ATC TTT GTT AAA AAT CAC 
CDYEG I KVQIKSPQ SFTGVI FVKNH 
2268 

TAT GAA ACA TGT CGT GTT GAA GTT TCC AAC TCT GAT GCA GCT ACT CTT GAG CTT GGT CTT CCA GCT TCA TTT GGA 

YETCRVEVSNSDAATLELGLPASFG 

2343 

ATG AAA CCA GTT ACA CTG TCT GCT ACA TCT TCA GAT TCT ACC TCT TCA CAG AAT ATT ACT TCT AAT AGT GGA CAT 

MKPVTLSATSSDSTSSQNITSNSGH 

2418 

AAA GTT GTT GGA AGA GCA CGC CGT GAT ACA CAA GAA AAA TCT TGT GGT CTT ACA GAA ATT GAA AAT GGA AAA TAT 
KVVGRARRDTQEKSCGLTEI ENGKY 
2493 

AAA AGT ACT GTT GTT ATA CAA ACA AAT AAC CTT GGA ATT CCT GGA CTT GTA ACA TCA ACA GAT CAA ATT TAT GAA 
KSTVVIQTNNLGI PGLVTSTDQIYE 
2568 

ATT GGT TGT GAT TAT AGT AGT ATG TTA GGA GGA AAA ATT ACT ACA GCA GCT AAT ATG ACT GTA AAT GGA CCA ACA 

IGCDYSSMLGGKITTAANMTVNGPT 

2643 

CCA ACT GAT ATT AAA CCT AGA GGT AAA ATT GAA CTT GGA AAT CCT GTT CTT ATG CAA ATG AAT GCT GGT ACA GGT 
PTDI KPRGKI ELGNPVLMQMNAGTG 
2718 

GAT CAT CAG CCA ATT TTA CAA GCT AAA CTT GGA GAT ATT CTT GAA TTA AGA TGG GAA ATT ATG GCT ATG GAT GAA 

DHQPILQAKLGDILELRWEIMAMDE 

2793 

GAA CTT GAT TTC TTT GTT AAA GAT TGT CAT GCA GAA CCT GGT ACT GGT GCT GGA GGA GAT GAA AAA CTT CAG CTT 

ELDFFVKDCHAEPGTGAGGDEKLQL 

2868 

ATT GAA GGT GGA TGC CCA ACA CCA GCT GTT GCT CAA AAA CTT ATT CCA CAA CCA ATA AAA TTA CAA TCA TCA GCT 
IEGGCPTPAVAQKLI PQPIKLQSSA 
2943 

GTC AAA ATT GCC CAT CTT CAA GCT TTC CGT TTT GAT TCA TCC TCT TCA GTT AGA ATA ACA TGT AAT ATT GAA ATT 
VKIAHLQAFRFDSSSSVRITCNI EI 
3018 

TGT AAG GGA GAT TGT AAA CCA GCA ACA TGT GAT ATG CAC GGA GAA TCA AAA CAA TCA TGG GGA AGA AAA AAG AGA 
CKGDCKPATCDMHGE SKQSWGRKKR 
3093 

CAT ATT GAA GAT GAT ACA ATT ACA GAA TTT GAG ACA AAT CGT TAT AAA GTT CCA AGA TTT TCA CAA GCA ACA ACA 
HIEDDTITEFETNRYKVP RFSQATT 
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3168 

TCT CTT TTA ATT CTT GAT CCA CTT CAA AAT AAC ATT GAA CCA GCA TCA TTA ATG TCA AAA GTA TCA TCT CTT GAT 

SLLILDPLQNNIEPASLMSKVSSLD 

3243 

TTG TTA GCT GAA GAT CCT GCA AAA ACA TTA CTT AAG ATT AAA GAG ACT GCA CAT TTG AAT GGA AAT CTT TGT ATG 
LLAED PAKTL LKIKETAHLNG NLCM 

3318 

GGA AAA ATT ACA CTT TTC TCA GTA TTT GGT GTT CTT CTT TCA TTA ATT GTT GTT CAA GCA ATT GTC GTA ACA AAT 
G KI T L F S V FGVLLS L I VVQA I VVTN 
3393 

TAT ATT TTT AAA AGA GTT ATG TCA AGC AGA AAG ATT ACC AAT TAAactttaataattaaacaataattataaatatgcctttatgt 

YIFKRVMS SRKITN 

3479 

tctcaaaacgagtataatccttttttttgttattaattttagtatcaaaatatatatacccgatggcatttacaataataataaatacaactgaagaaag 
3579 

ctataatatgaaaccgtgccagaaacttattcaaagtttttaatctctctctctctctttctaatttcctttcaaaacattccattttttttttttgttt 
3679 

ttatttaatcaaaaaataataattaaatagtaatttatgatatatcattaatatttttataatatttttttg 
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l 

ggcacgagaa ATG AAC TGG CTA TCT ATA GCT TCA ATT TGT ACA TTC TTA ATT ATA CCA ATA TCT GCT GTC TTT GAA 
MNWLSIASICTFLIIPISAVFE 

77 

TGT TCA GGA TCA GAA ACT ACA GCA TTT ATT AGA ATA TCC AGA GCA CGC CTT GAT GGG ACA CCA GTA GTT ATT TCT 

CSGSETTAFIRISRARLDGTPVVIS 

152 

ACA GCA GGA CAT GAC TTG ACT TGT GCA CAA TAT TGT AGA AAT AAT ATT GAA CCA ACA ACT GGT GCT CAA CGT GTC 
TAGHDLTCAQYCRNNI EPTTGAQRV 
227 

TGT GCA TCA TTT AAT TTT GAT GGT CGT GAA ACA TGC TAC TTT TTT GAT GAT GCT GCC TCA CCT GCT GGG ACT GGG 

CASFNFDGRETCYFFDDAASPAGTG 

302 

GAG TTG AAT GAA GCA CCA TCA GCT AAT AAT TTT TAT TAT GAA AAA GTT TGC CTT CCA GCT ATC TCT GCT CAT GAA 
ELNEAPSANNFYYEKVCLPAI SAHE 
377 

GCA TGT ACT TAT AGA TCA TTT TCA TTT GAA AGA ACT AGA AAT ACT CAA TTA GAA GGT TTT GTT AAA AAA TCA CTA 

ACTYRSFSFERTRNTQLEGFVKKSL 

452 

CAA GTT ACA TCA CGT GAA GAA TGC CTT TCT ACA TGT TTA AAA GAA AGT GAA TTT GTA TGT AGA TCA GTT AAC TAT 

QVTSREECLSTCLKESEFVCRSVNY 

527 

AAT TAT GAA AAC TTT ATG TGT GAA CTT TCA ACA GAA AGA TCG CGT TCT AAA CCA CAA AAT ATG AGA ATG TCA GCA 

NYENFMCELSTERSRSKPQNMRMSA 

602 

GCT CCA GTT GAT TAT TAT GAT AAT AAT TGT TTA AAT AGA CAA AAT AGA TGT GGT GAA TCT GGT GGA AAT TTG ATT 

APVDYYDNNCLNRQNRCGESGGNLI 

677 

TTT ATT AAA ACA ACA CAA TTT GAA ATT CAT TAT TAT GAT CAT ACT CAA TCA ATG GAA GCA CAA GAA TCA TTC TGT 

FIKTTQFEIHYYDHTQSMEAQESFC 

752 

TTA CAA AAA TGT TTA GAT TCA TTA AAC ACC TTC TGT AGA TCT GTT GAA TAT TCT CCA TCT GAA AAA AAT TGT ATT 

LQKCLDSLNTFCRSVEYSPSEKNCI 

827 

GTT TCT GAT GAA GAT ACA TAT TCA AGA GCT GAT CAA CAA GGT GAA GTT AAT AAT AAA GAT TAT TAT GAA CCT GTT 

VSDEDTYSRADQQGEVNNKDYYEPV 

902 

TGT GTT GCT GCT GAT CTT AGT TCA TCT ACA TGT CGT CAA CAA GCT GCT TTT GAA AGA TTT ATT GGT TCT GCT ATT 

CVAADLSSSTCRQQAAFERFIGSAI 

977 

GAA GGT ACC CCA GTT GCT ACA GCA CAA CAA GTA ACC ATT TCT GAT TGT ATT TCA CTT TGT TTC CAA AAT TTG AAT 
EGTPVATAQQVTI SDCI SLCFQNLN 
1052 

TGT AAA TCA ATT AAT TAT GAT CGT ACA CAA TCT ACA TGT TAT ATT TAT GCT GTT GGA AGA CAA GAA TCT AAT GTT 

CKSINYDRTQSTCYIYAVGRQESNV 

1127 

AAA AAT GAT GCA AGT TTC GAT TAT TAT GAA TTT ACA ATT ATT GAT AAT GGA TGC CCA AGA TAT CCT GCT CTT GTA 
KNDASFDYYEFTI IDNGCPRYPALV 
1202 

GGG CCA GTT TTA CAA GAT TTC GAC AAA AAT CGT CTT AAA TCT GAA ATG AAA GCA TTC CGT TTA GAT GGA TCA TAT 

GPVLQDFDKNRLKSEMKAFRLDGSY 

1277 

GAT ATT CAA ATT GAA TGT TCT GTT ATG TTT TGT GCT GGT CCA ATG GGT TGT CCA CCA TCT AAT TGC CTT GAT TCA 

DIQIECSVMFCAGPMGCPPSNCLDS 

1352 

GGA ACA AAT GAA TTA TTT GCT TCA CAT GGA AGA AAG AAA AGA AGT ATT GTT GAT TTC AAA AAT ACA ACA ACA TCT 

GTNELFASHGRKKRSIVDFKNTTTS 

1427 

GCA GAA ACA TTA TCT GCT ATA ATT AGA GTA CTT GCT GCT GGA GAA GAA GAA TTA GAA GTT GAA GAA TTT TAT AGA 
AETLSAI IRVLAAGEEELEVEEFYR 
1502 

AAT GAT ACT AAT TTT AAA TAT GAT TCT GAA GAA AAT ATC TCA GCT CAT AAC TTA TAC TGT ATG TCT GAA ATG TGG 
NDTNFKYDSEENI SAHNLYCMSEMW 
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1577 



TTT GTA TCA GGA ATT GTT TCA ATG GCT ATG ATC TGT CTT CTT CTT TCT GTT CTT ATA GTT ATG TGG GGC TGT CAT 

FVSGIVSMAMICLLLSVLIVMWGCH 

1652 

TCA TTA AAT CAA TCT TCA AAA TTA CCA ATG TGAaggaagatctttcaacaaaaaaaaacgattaatttttaatatttctttaatatatac 

SLNQSSKLPM 

1742 

attccataatcagtatatactataataattgcaacataataatttattgtagaagtctgtttataaaatcaaatcacaaatttttcttttacagtactgt 
1842 

gcacaacaacaagaaattccaatctcttcctatattttgatgtcgtacacacgtttataaaaacaaattcttttggtttttaatcagttttcagtttaca 
1942 

tttatataat 
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l 

gcgtcgatagtccgattttttaaggtttaattacccaagcttaaggaatatttgaagcttatttttaaagaaaaaataaattaaataagagattagcaca 
101 

acaacaacagaaatttttcttgaatttacaacaaaataattttttcttaattaaattcctttaaattatccacaacttct ATG GTT ACA AAA ATC 

M V T K I 

196 

CCA ACT TTT CCC CTC CTT TTT ATT TTC CCA TTT TTA TTT ACA TTT TTA ACG ACA AAA TGT CAG GCT TAT TCT ATA 

PTFPLLFIFPFLFTFLTTKCQAYSI 

271 

CCA TTA ATA TCA GAA TGT AAT TCG GAA GAA GCC CCA GTT TTT CTT TTG CAA CGG AAT GTT TCT TCT ATC GCC GGA 

PLISECNSEEAPVFLLQRNVSSIAG 

346 

ACT GAG CCT TTA AGA ACT GTT CCT GTT ACA GGG GGA TTT TTG GAA TGT GCG GAA CTT TGT TCA GCA GCA AAT AAT 

TEPLRTVPVTGGFLECAELCSAANN 

421 

TGT GTT GCT GTT AAA TTT TCT ATT GAA AAA CAA TGC CAA TTG TTG GGG AAA ACA ACT ATG ACA GCA ACA ACT TTA 

CVAVKFSIEKQCQLLGKTTMTATTL 

496 

TCT TTA CAA GAC ATT AAT TTG ACA CTA GCT AGA TTA GCT ACT AAA AGT TGT GTT AAG AGC AAA AAA ATC TGT TCT 
S LQDINLTLARLATKSCVKS KKI CS 
571 

TCC CCC TTC CAT TTT GAT GTT CAC GAA CAA AAA ATA CTT GTT GGT TTT GCT AGA GAA GTT GTA TCA GCA GAA TCT 

SPFHFDVHEQKILVGFAREVVSAES 

646 

ATA CAT CAA TGT TTA ACT GCT TGT TTA GAT GCT GTT GAT ACT TTT GGC TTT GAA TGC GAG TCA GTA ATG TAT TAT 
IHQCLTACLDAVDTFGFECE SVMYY 
721 

CCA TTG GAT GCC GAA TGT ATT TTA AAT ACA GAA GAC AGA CTT GAC CGT CCA GAT TTG TTT GTT GAT GAG AAG GAA 
P LDAE C I LNTEDRLDR PD L * F V D E KE 
796 

GAT ACT GTT GTT TAT TTG GAT AAT AAT TGT GCT GGA TCC CAA TGT CAT GCC CCT TAT GTA ACC CAA TAT GTA GCT 

DTVVYLDNNCAGSQCHAPYVTQYVA 

871 

GTT GAA GGA AAA CAA TTA GCT GAG GAA TTG GAT CAT AAT TTT GAG GGA ATG GAG TTG ACA GAA TGT GAA CAG CTT 

VEGKQLAEELDHNFEGMELTECEQL 

946 

TGT AAT CAA AGA TTG AGT GTT TCT GCA AAT GAC TTT AAT TGC AAA GCA TTT ATG TAC AAT AAC CAA ACA AGA TCT 

CNQRLSVSANDFNCKAFMYNNQTRS 

1021 

TGT ATT CTT TCT GAT GAA CGT TCA AGA CCT TTG GGT AGA GCT AAT TTG ACA GAT GCT AAA GGA TGG ACT TAT CAC 

CILSDERSRPLGRANLTDAKGWTYH 

1096 

GAG AAA AAA TGT TTT GCC TCC CCA CGT ACA TGC CGA AAT GTT CCT TCT TTT ACC CGC GTC CCT CAA ATG TTA TTA 

EKKCFASPRTCRNVPSFTRVPQMLL 

1171 

GTT GGA TTT GCC TCT TTT GTA ATG GAA AAT GTC CCT TCA GTA ACT ATG TGT TTG GAT CAA TGT ACA AAT CCT CCC 

VGFASFVMENVPSVTMCLDQCTNPP 

1246 

CCA GAA ACT GGA CAA AGT TTT GTT TGT AAA TCT GTC ATG TAT TAT TAT AAT GAG CAA GAA TGT ATT TTA AAT GCT 
PETGQSFVCKSVMYYYNEQECI LNA 
1321 

GAA TCA CGT CAT TCC AAG CCA GAT TTA TTT ATT CCC GAA GAA GAC GAT TTT GTT GTA GAT TAT TTT GAT ATA AAT 
ESRHSKPDLFI PEEDDFVVDYFDIN 
1396 

TGC CGT CTA GAA CAA GAA CAA TGT ATC GAT GGA AGA ACG CCC CAA TTA GTT AGA ACA ATT AAT TCT GCA CTT CCA 

CRLEQEQCIDGRTPQLVRTINSALP 

1471 

GAA GGG GAG GGG TCT ATA CAT GTT TTG GAA ACA ATT AAG GGA GGA GTT CAG CAA TGT GCT AAA AAA TGT TCT GAA 
EGEGS I HVLETIKGGVQQCAKKCSE 
1546 
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CGC GCC CCA GAC AAA TGT CGC TCT TTC AAT TTT GAT AAA CAA GCT GGT AAT TGT AAT TTA CTT TAT TTG GAT GGA 
RAPDKCRS FNFDKQAGNCNLLYLDG 
1621 

CAA GGG TCT TTA CGA CCA GAG CAA AAG ACA CAA TTC GAT TTA TAC GAT GTT CAT TGT TTG AGT GGA ACA TCT CAA 

QGSLRPEQKTQFDLYDVHCLSGTSQ 

1696 

CTT TTA GGA GAA AAT TCT AAA CAT TCT CCC TCT GCT TGT GTT GAC CCA GAA GGG GCT ATT TTT AGT CGT TTC CTC 
LLGENSKHSPSACVDPEGAI FSRFL 
1771 

TAC ACT CGT TGG GTA GCA AAT TCT CCC AAT CGT GAA ATT TCA AGT TTA CCA CTT TCC AAA TGT TTA AAT CTT TGT 

YTRWVANSPNREISSLPLSKCLNLC 

1846 

tcg gtt gga gga gaa caa tgt gag ggt gtt aat tac aat cgc cga aat ggt tct tgt caa tta ttt act tcc ctt 
svggeq'cegvnynrrngscqlftsl 

1921 

CTA TTA AAC TCT TCT CCA AAT TCT CAA CAA GAC AAA GAC GAA CAT GTT GAT TTT TAC AGA AAT ATT TGT AGA GTT 
LLNS S PNSQQDKDEHVDFYRNI CRV 
1996 

AAG GAA TCG AAA AGT GAT AGT GGG GCT GCT AAT GTA CCC AAA ACA CAA CAA GCA ACG GCT GCA CCT CCC CCT TCT 
KESKSDSGAANVPKTQQATAAPP PS 
2071 

GTT CAA TTA ACT ACT AAA CCT CCA CAA ATT CGT GAT TTA AAC AAC AAC AAT AAA ACA ACA CAC AAA GAA CCA AAT 

VQLTTKPPQIRDLNNNNKTTHKEPN 

2146 

ATT AAA CTT CCA CCA CAA TCA GCA AAA CCT ATA AAT GGA AAA ACT GGA AAG GAA CAA CTT CCT GTA GGG TCA AAA 

IKLPPQSAKPINGKTGKEQLPVGSK 

2221 

TCT TTT GGG GTT ACT AAT ACG CGT GAT GAT GGG GAG AAT TCA ATA ACT GGA ACT GCT CCT CCT CCT GTA GAT GGC 

SFGVTNTRDDGENSITGTAPPPVDG 

2296 

AAA TTA ATT ATT AAA CCT TCA CCA CAA GTT TCT ATT CCC TCC CCT GTA CTT ATT CCG GCA CAA GAA GTA CAT ACT 

KLIIKPSPQVSIPSPVLIPAQEVHT 

2371 

ATT TGT AAT TAT GAA GGA ATT AGT GTT CAA ATT AAA CAT TCT TCT CCA TTC TCT GGC GTT GTT TTT GTT CGA AAT 

ICNYEGISVQIKHSSPFSGVVFVRN 

2446 

AAA TAT GAT ACT TGC CGT GTG AAG TTG AAG GAA AGG ACA GCG TTG TTT TGG TTT TGG GGC TTC CAG CAA ATT TTG 

KYDTCRVKLKERTALFWFWGFQQIL 

2521 

GAA ATG AAG CCA ATT GCT TTA ATT AAT TCA CAA AAA CAT GGA AAA GGG AAT AAA ACA CAC GGA GAT ACT TTA CTT 
EMKPIALINSQKHGKGNK THGDTLL 
2596 

TCT ATT GAA GGT TCC AAA AAA CAA ATT GAA GGG GGT TCT TCA ACT GAA GAT ATT CAA TTA ATA AAT TCT CAA AAA 
S IEGSKKQI EGGSSTEDIQLINSQK 
2671 

GAC CTT AAA CGT TCA AGA AGA CAA TTA CAA AGA GAT TGT GGA TTA CAA GAT ATG GAC AAT GGA ACT TAC AAA ACT 

DLKRSRRQLQRDCGLQDMDNGTYKT 

2746 

GTT ATT GTT GTC CAA ACA AAT AAT TTG GGA ATT CCG GGA CTT GTT ACT TCT ATG GAC CAA CTT TAT GAG ATT TCC 
VIVVQTNNLG I PGLVTSMDQLYE I S 
2821 

TGT AAC TAT TCA AGT ATG TTG GGA GGC AAA GTC CAA ACA GCA GCT GCA TTA CGT GTT CAC GGT CCC CAA CCT TCA 
CNYS SMLGGKVQTAAALRVHGPQPS 
2896 

CTA ATC CAG CCT CGC GGC AAA ATA GAA TTG GGA AAT CCT GTT TTG ATG CAA ATG GGG CCT GTA CGT AGT GAA AGG 

LIQPRGKIELGNPVLMQMGPVRSER 

2971 

CAA AGT GGG GAA GGG CCT TTA ATT CAA GCT AAA TTG GGG GAT ATT CTT GAA TTA AAA TGG GAA ATT ATG GCA ATG 

QSGEGPLIQAKLGDILELKWEIMAM 

3046 

GAT GAA GAA TTG GAC TTT TTA GTT CGT GAT TGT TTT GCA GAG CCG GGA ACT TCT GGA AAT CAA GGG GAA AGA CTT 
DEELDFLVRDCFAEPGTSGNQGERL 
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3121 

CCT TTA ATT GAG AAT GGT TGT CCA ACA CCA GCA GTA GCA CAA AAA TTA ATT CCA AAT CCA ATA AAA GCA ATT AAT 
PLIENGCPTPAVAQKLI PNPIKAIN 
3196 

TCT GCA GTT AAA TTA ACT TAT TTA CAA GCA TTC AGA TTT GAC AGT TCT CCA GCT ATT AGA ATA ACT TGT CAT TTA 

SAVKLTYLQAFRFDSSPAIRITCHL 

3271 

GAA TTA TGT AAA GAA AAT TGT AAA TCG GTT AAT TGT AAA TTT AAT GAT GGA ATT AAA GAA TCG TGG GGC AGA AAA 
ELCKENCKSVNCK FNDGIKESWGRK 
3346 

CGC CGT TTT GCT ATT GAC AAT AAC ATT AAT AGG AAA AAT GAA GTT AAA GAA TTC GAA ACT CGC CGT TTT GTC GTT 

RRFAIDNNINRKNEVKEFETRRFVV 

3421 

CCC CGT TTT GCC CAA GCA ACA ACT TCT TTA GTT ATT GTA GAC CCT TTA CAA CAA CAA AAT TCT GTT ATA AAA ACA 

PRFAQATTSLVIVDPLQQQNSVIKT 

3496 

GAA CAA CAA CAA CAA CCA TTT ATT TCA CAT TCC TCA ATA TCT AAA CAA ATA TTT GAA AAT AAT AAA AAA GAA AAT 

EQQQQPFISHSSISKQIFENNKKEN 

3571 

AAT AAA AAT ATA ACA AAA ACA GCT AAA AAA TCC TCT TCT CTT TTT GAA GCT TTT ACT GAG GCT GCT GGT GGA AGG 
NKNITKTAKKSSS LFEA FTEAAGGR 
3646 

AAA ATT AAT TTA GAA TTA ACA ACA ACA AAT TCA GAA CAA CAA CAA CTT TGT TTA CAT AAA TGG ACA CTT GGG GGT 

KINLELTTTNSEQQQLCLHKWTLGG 

3721 

GTT TTT GGA ACT CTT TTA ACA TTA ATT GTT GTT CAA AGC GGG GTT GCT GCT AAA CAT TTA ATT AAT CGA TTT ATT 
VFGTLLTLIVVQSGVAAKHLINRF I 
3796 

GTT GGA AAA AGA ATT TAAaaaaaaaaaaaaaagtactagtcgacgcgtggcc 
V G K R I 
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NEMATODE PAN AND ZP RECEPTOR-LIKE SEQUENCES 



l 

GAG CAG AAG ATT TTG GTG GGT TTC GCG CGG GAG GTG GTC 
EQKI LVGFAREVV 
67 

TCC GCT TGT CTG AAT GCG TTC GAT ACG TTC GGC TTC GAA 

SACLNAFDTFGFE 

142 

TGC ATT TTG AAC ACG GAG GAC CGA TTG GAT CGG CCT GAC 

CILNTEDRLDRPD 

217 

TTG GAC AAC AAT TGC GCC GGA TGT GAG TGC CAT TGG CAT 

LDNNCAGCECHWH 

292 

CAA CAA TTC GCA ATT GCA GCA CAA TGT TAC GCA CCG TAC 

QQFAIAAQCYAPY 

367 

TCG GAC GAA TTG GAC CAC AGT TTT GAA GGG TTG GAG CTG 

SDELDHSFEGLEL 

442 

GTT ACG GCA AAC GAC TTC AAC TGC AAA TCG TTC ATG TAC 
VTANDFNCKS FMY 
517 

CGC TCG CGC CCT TTG GGC CGT GCC AAT TTG GCC GAA GTG 

RSRPLGRANLAEV 

592 

TCG TTT ACG CGA GTG CCG CAA ATG CTT TTG GTG GGC TTT 

SFTRVPQMLLVGF 

667 

ATG TGT TTG GAC CAA TGC ACA AGC CCT CCT CCT GAG ACG 
MCLDQCTS PPPET 
742 

TAC AAC GAG CA 
Y N E 



TCC GCC GAC TCA GTC CAC CGC TGT CTG 
SADSVHRCL 

TGC GAG TCG GTC ATG TAT TAC CCT GTG GAC GCG GAA 
CESVMYYPVDAE 

CTT TTC GTG GAC GAG CAC GAG GAC ACG GTC ATC TAC 
LFVDEHEDTVIY 

TTT GAC AAT TTC AAA ACA AGC GGC ATT TTG AAC GAC 
FDN'FKTSG I LND 

GTA ACG CAA TAC GTG GCG GTG GAA GGA CGC CAA TTG 
VTQYVAVEGRQL 

AGC GAA TGT GAA GAG TTG TGC ACG CAA CGG TTA AGT 
SECEELCTQRLS. 

AGT AAC TTG ACG CGC AGT TGC GTT TTG TCG GAC GAA 
SNLTRSCVLSDE 

CCG GGA TGG ACT TAT TTC GAG AGC CGC GGC GTT CCG 
PGWTYFESRGVP 

GCC TCT TTT GTG ATG GAA AAT GTG CCG TCA GTG ACA 
ASFVMENVPSVT 

GGA CAA AAC TTT GTG TGT AAA TCG GTG ATG TAC TAC 
GQNFVCKSVMYY 
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l 

cgactggagcacgaggacactgacatggactgaaggagtagaaaatttctgttgttcatttcttatcagactgtcccattcatcatcgtgaccactacca 
101 

gtattacttcaggacagtaatattcgggtaaatttcggctgctcaatcggtaggaccgctttaat ATG CAT CTT TCC AAC CAT GCC TCA 

MHLSNHAS 

190 

TCA CTT CTG CAT TAC TAT TCA CAT CTC ATC ATA ATT GCA TAC TTT TCT GTA TTT GCT TCA ATC GAA ATA CAA GAA 
SLLHYYSHLII IAYFSVFASIEIQE 
265 

ATT CCA TCA TAT CCA GCA TGT AGC AAT GGC GAA TCA CCT GTC TTT TTA CTC CAA CAC AAT GCT ACA GCA GGT AAT 
I PSYPACSNGES PVFLLQHNATAGN 
340 

GTT CTG AAG CGA GCT TCA ACT TCA CAT CTG GTC GAC TGC ACT GAC CTT TGT TCA GCT AAC GAT GAA TGT TTG GCG 

VLKRASTSHLVDCTDLCSANDECLA 

415 

ATA ACC TAT GAA GAT AAA GAA TGC AAA ATG TTG TCA AGC ATT GGA GAA TCG ACA GGA CAT TTA AAT GAT TAT GTA 

ITYEDKECKMLSSIGESTGHLNDYV 

490 

TTG CTG AGT AAA AAT TGT GCT AAA AGT GCG CGG ATC TGC TCA TCG CCA TTT CAA TTC GAT GTA CAC AGA CAA AAA 
LLSKNCAKSARI CSSPFQFDVHRQK 
565 

ATT TTG GTT GGG TTT GCT CGC GAG GTT GTG TCA GCT GAT TCA TTA TCG TTA TGT CTA TCA GCT TGC TTG AAT GCA 

ILVGFAREVVSADSLSLCLSACLNA 

640 

TTT GAT TCT TTC GGT TTT GAA TGT GAG TCG GTA ATG TAC TAT CCA GTT GAT TCA GAA TGC ATC CTA AAC ACC GAA 

FDSFGFECESVMYYPVDSECILNTE 

715 

GAT CGT CTG GAT CGA CCT GAC TTG TTT GGG GAT GAA TTA GAT GAT AAC GTC ATT TAT TTG GAT AAC AAC TGT GCT 

DRLDRPDLFGDELDDNVIYLDNNCA 

790 

GGA TCA CAG TGT TAT GCT CCA TAC ATA ACA CAA TAC ATT GCC GTC GCA AAT CGT CAG CTA GCT AAC GAG TTG GAC 

GSQCYAPYITQYIAVANRQLANELD 

865 

AGA CAA CTG ATC GCT GAT CGT GAA TCA TGC GAG TCG TTA TGT ACT CAG CGA CTG TCT ACA ACG ACA AAC GAT TTC 
RQ.LIADRESCESLCTQRLSTTTND F 
940 

AAC TGT AAA TCA TTT ATG CAT AAT CCG GAA ACT AAC GTT TGC ATA CTT TCT GAT GAA CGT TCT AAA CCA CTT GGT 
NCKS FMHNPETNVCI LSDERSKPLG 
1015 

CGA GGC AAT CTA GTG AAA GCT GAC GGT TTC ACA TAT TAT GAG AAG AAA TGT TTT GCA TCA CCA CGA ACA TGT CGC 
RGNLVKADGFTYYEKKCFAS PRTCR 
1090 

AAT GTA CCG TCG TTT GAG CGC ATA CCT CAG ATG ATA CTT GTT GGT TTT GCT GCA TTT GTT ATG GAA AAT GTA CCT 
NVPS FERI PQMILVGFAAFVMENVP 
1165 

TCA GTA ACG ATG TGC CTC GAT CAG TGC ACA AAT CCT CCA CCG GAA ACT GGA GAA AAT TTC GAA TGC AAA TCT GTG 

SVTMCLDQCTNPPPETGENFECKSV 

1240 

ATG TAT TAT TAT AAC GAA CAG GAA TGT ATT TTA AAC GCT GAA ACA CGA GAA AAT AAA TCG GAA TTG TTT ATA CCG 
MYYYNEQECILNAETRENKSELFI P 
1315 

GAG GGA GAA GAA TTC CAA GTC GAT TAT TTT GAT ATC ACT TGT CAT CTG CGC CCT GAA ACA TGT CCA AAT GGC ACA 
EGEEFQVDYFD ITCHLRPETCPNGT 
1390 

ACA TTA CAT ACT GTA CGT ACG GTT AAT GCA GCA CTC CCT GAA GGC GAA GGA TCG ATC CAT ATT TTG CAG TCA GCC 

TLHTVRTVNAALPEGEGSIHILQSA 

1465 

GGG AAT TCG GTT GCT GAT TGC ATG ACA AAA TGT TAC GAG ATG GCT CCC GAG AAA TGT CGC GCA TTC AAT TTT GAT 
GNSVADCMTKCYEMAPEKCRAFNFD 
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1540 

AAG CAG ACA TCT GAC TGT GAC CTG CTG TAC GTT GAT 

KQTSDCDLLYVD 

1615 

GAT CTC TAC GAC CTT CAT TGC CTA GAG CAG ACA AAA 

DLYDLHCLEQTK 

1690 

TAT TTG TAC AGT ATA TAT GAT GCA GTG CCA TCG CAA 
YLYS IYDAVPSQ 
1765 

TGC GCA TAT ACC GAG CGT TGC GAA GGT GTA AAT TAC 

CAYTERCEGVNY 

1840 

GAA GGA AAT GGA AAG CCA AGT GAT TTC ACG GAT TTT 

EGNGKPSDFTDF 

1915 

TAT AGC GCC GCA GCT AAT GTT CCC AAA CAT CTC CTT 

YSAAANVPKHLL 

1990 

GCT AAA TTA CAC ATT ATC TCA GCA AAA ACA AAG CCT 
AKLHI I S'AKTKP 
2065 

ACA ATA ACA GCG AAG TCG TCT TCA GCT TCC GGA AAA 

TITAKSSSASGK 

2140 

TCT TCA TCC GGA AGG CTT CCA GGG CCA GTA GTC CAA 

SSSGRLPGPVVQ 

2215 

GGC ATC AAA GTG CAG ATG GAG AAC CCC AAA GCC TTT 

GIKVQMENPKAF 

2290 

CGA GTA GAG GTT ACG GAT AGT GAA AGT GCA CCA CTA 

RVEVTDSESAPL 

2365 

GCT GAT GAA AAG GTT GCC GCA AGC GAA GCA AAT ATT 

ADEKVAASEANI 

2440 

GCT GAT GAA CTG CGC ATA AGA CGA CAA GCT TTA GAG 

ADELRIRRQALE 

2515 

ACT TAT AAA TCA ACG GTG GTT GTA CAA ACA AAT AAC 
TYKS TVVVQTNN 
2590 

TTT GAA GTG AGC TGT GAT TAT AGT TCA ATG CTT GGT 

FEVSCDYSSMLG 

2665 

CCC GAA GCA TCT CTT ATT CAA CCC CGA GGA AAA ATC 

PEASLIQPRGKI 

2740 

CAA GGA GAA CCT GTC CTA CAA GCA AAA CTA GGT GAC 
QGEPVLQAKLGD 



GGG AAG ACA ACC TTA CGA CCA GCA GTC CAC TCG GGC ATT 
GKTTLRPAVHSG I 

GTT TGC GCT CAG AAA AAC AAC GTA ACA CGA TTT TCG AGA 
VCAQKNNVTRFSR 

TTC TAC GAA GCA ACT GCC CTC ACA AAT TGT CTT AAT CTT 
FYEATALTNCLNL 

AAC AGA AGG AAT GGT CGT TGT GAA TTA TTT GAT AAG GTC 
NRRNGRCELFDKV 

TAC AAA AAT CTT TGT CTG GTG GAA GAA GTA GAA TCA GAA 
YKNLCLV EEVESE 

CCG AAT GTT TCA CAT TCT GCA GTT ACT CAG AAA CAA GAA 
PNVSHSAVTQKQE 

TTC CTA CGC GAA CAG GAA GCA CAG CGA CGA GCT CCA GAA 
FLREQEAQRRAPE 

GTA AGT GGT GAA GCA GGA TCA TCA ACT ACA TTC AGC ATT 
VSGEAGS.STTFSI 

ATT GCT CCA AAT GCA GTG CAA ACA GTT TGC AAT TAT GAA 
IAPNAVQTVCNYE 

TCG GGA GTG ATA TTT GTT AAA AAT AGG TAT GAA ACC TGT 
SGVI FVKNRYETC 

GTA ATT GGT TTA CCA CCG AAT TTT GGT TCA AAA ATG GTA 
VIGLPPNFGSKMV 

CAA CCA GAA ATA TCC GGA GGC GAC AAA CTG GAT AAA CCC 
QPEISGGDKLDKP 

CTA CAC AGA GAT TGC GGA ATC CAG GAT ATG AAC AAT GGT 
LHRDCGI QDMNNG 

TTG GGT ATA CCT GGA CTG GTA ACT TCC ATG GAT CAG ATT 
LGI PGLVTSMDQI 

GGA AAA GTT ACT GCT GGT GCC AAT CTC ACA ATT GAT GGT 
GKVTAGANLT I DG 

GAA CTT GGT AAC CCG GTG CTT ATG CAG ATG TTG AGT GGA 
ELGNPVLMQMLSG 

ATT CTG CAG CTA CGA TGG GAA ATC ATG GCG ATG 
ILQLRWEIMAM 
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1000 
i 



Query 



Sinilar doaaip architectures 



PflNJIP 



NP_524831 

Drosophila melanog 
no mtchanoreceptor 

NP.505B75 

Catnorhabditis el« 
F47G9.3.P 

NP.502253 
Cacnorhabdiiis *le 
LET-653 mucin Uk* 

NP.502699 

Caenorh«bditi* «l* 
F52B11.3.P 

NP.501670 

Catnorhubditi* dt 
F38E11.4.P 

NP.502252 

Catnorhabditis «le 
C29E6.4.P 

NP_491706 

Catnorhabditis «le 
C34G6.B.P 

ARB52479 

Catnorhabditis «!• 
Hypoih*iical prot* 




AAK09434 

Drosophila rotlanog 
no-r*«chanor«cftpior 
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SEQ ID NO10 MVTKIPTFFLLFIFPFLPTFLTTKCQA YSIPLISBCNSBEAPVFU^PJrcSSIAaTEPLRTVPVTOGFI^CABLCSAANNCVAVKFSI^ 126 

SEQ_ID_NO_ll 
SEQ~ID_NO_16 
SEQ ID_NO_17 
SEQ_ID_NO_45 
SEQ ID_NO_3 



MKVFAVLALVVASVLA DTLFSVTLCFFBTQTIFVLQHN- 

MKVFAVLALVVASVLA DTLP SVTLCPPBTQTIPVLQHN- 

MKVFAWALLAVSALA DTLFSVTICPPBTQTIFVLQHH- 

MSKSOLHLVAYILLIFLISTNIA SKISQVPLCNXDTSPVFTLQHN- 

SBQ ID NO_12 MHLSNHAS SLLHYYSHLI I IATPSVPAS IB IQE I F SYPACSNGBSF VFLLQHN- 
ruler 1 10 20 30 40 50 



• TTVO - ARIRTIPTSH - LABCSDHCSASLDCQGVBFK -DGSCAVFRAOSB KATAQS QLLTKTCVKSD 

■TTVa-ARIRTIPTSH-LABCSDHCSASLDCQOVBFX-DGSCAVFRAaSB KATAQS QLLTKTCVKSD 

• STVQ - ARIRTIPTSH - LABCSDHCAASLDCQGVBPX -DOS CAVFRAGSE KATXGS QLLTKSCVKSD 

• STNG - 1 LARS LPQPO - LIDCSKHCSSS8DCIGVBYW - QOICRVI SQDKTS IYTPTDBTS ILLTKSCVKSD 

■ATAGNVLKR- -ASTSHLVDCTDLCSANDECLAITYE-DKECKMLSSiaBS- -TGHLNDY VLLSKNCAKSA 

60 70 80 90 100 110 120 130 



101 
101 
101 
112 
118 



. : 



• | | ***] • m *| | ****** B • | | • ************* > ** * .* * *.*.**** 



SEQ_ID_NO_10 KICSSPFHPT)VHEQJCILVQPARBWSABSIHQCLTACIJ)AVDTFaFB SQCHAPYVTQYVAVB 232 

SEQ ID_NO_ll BQKILVGFAKBVVSADSVBRCLSACZiMATDTPGPBCBSVHYYPVDABCILNTBDR 118 

SEQ_ID_NO_ 16 RVCQSPP^FDLFBQRILVGFARBVVPAANIQICMAACLNAFDTFGFBCBSAHFYPVDQBCILNTB^ YNYQKTTFSKSQCYPPYITQYIAVB 224 

S SBQ_ID^KO*3 RICSSPFBTOraQKILVGFARBW - -^- — -^QCYAPYITQYIAVB 218 

SBQ_ID_NO_12 RICSSPPQPDVHRQKILVGPARBVVSADSLSLCLSACLNAFDSFGFBCESVMYYFVDSECILNTBDRIiDR^ - - SQCYAPYITQYIAVA 224 

ruler 140 150 160 170 180 190 200 210 220 230 240 250 260 



..* : j***: i *j #*. *) | | ***.**|** s . *. * *,****,**♦*, 

SEQ_ID_NO_10 GKQIJLBBLDHNFBGMBLTBCBQLCNQRLSVSANDPNC?KAFMYN^ 

SEQ_ID_NO_ll GRQLSDBXiDHSPEGLBLSBCBBLCITQRLSVTANDFNCXSFMYSNLTRSCVLSDBRSRFXiGRJ 
SEQ_ID_NO_16 GKQLKHBLDR- IINVDLDSCQALCTQRLSISSNDFNCKSFMyNNKTRTCIIArBRSKPLGRADLIA' 
SEQ_ID_NO_17 GKQLKHBLDR- IINVDIJ)SCQAIjCTQRLSIS8NDP1ICK8FMYNNKTRTCIIJU)BRSKPL<3RADLIA' 
SEQ_ID_NO_45 GKQIJCHBIiDR-IIHVDIJISCQALCrQRLBISSHDPNCKSFM^ 

SEQ_ID_NO_12 NRQIAHBLDR-QLIADRBSOTSUnrQRLSTTTHDFNOT 

ruler 270 280 290 300 310 320 330 



'ASPRTCRHVPSPTRVFQMLLVaFASFVMBHVFSVTMCLDQCTHFFFBTGQSP 362 

SROVPSPTRVFQMLLVaFASFVMENVPSVTMCLDQCTSPPPETGQNF 239 

'ASPNTCRKVTSFKRVPQMILVaFAAFVMElIVPSVTMCIJJQCTNPPPBTaDGF 353 

'ASPHTCRNVFSPKRVFQMILVGFAAFVMBNVFSVTHCLDQCTNPFPBTGDGP 353 

'ASPOTCRKVTSPKRVPQMILVaFAAFVMKNVPSVTMCLDQCTNPPPBTGEGF 352 

'ASPRTCRQTPSFNRVPQMILVGFAAFVMENVFSVTMCLDQCTNPPFETGBKF 348 

'ASPRTCRNVFSPBRIPQMXLVGFAAFVMENVFSVTMCLDQCTNPPPBTGBNF 353 

340 350 360 370 380 390 



SEQ_ID_NO_10 VCKSVMYYYNEQBCII<NABSRHSKFDLFIFBBDDPVVDYFDINCRLBQBQCIDGRTPQLVRTIN^ 491 

SBQ_ID_NO_ll VCKSVMYYYNB 250 

SBQ_ID_NO_16 VOTBVMYYYNBQECILNSBTRBSKPBLFIFBGBBPLVDYPDITCmiKQEKCPTGQ 483 

SBQ_ID_NO_17 VCKSVHYYYNBQBCILNSBTRBSKPBLFIPBGBBFLVDYFDITCHLKQBKCPTOQHIJC^ 483 

SEQ ID NO 45 VCKSVHYYYNBQBCILNSBTRBSKPBLFIFBGBBFLVDYFDITCHLKQBXCFA 482 

SEQ_ID_NO_3 VCKSVMYYYNBQBCIIiHABTRBTKFDLFITBGDBFLVDYFDISCHXdSPBTCFKG 477 

SBQ_ID_NO_12 ECKSVMYYYNEQECILNAETRENKSBLFIPEQEEFQVDYFDITCHLRFETCPNGTTLBTV^ 482 

ruler 400 410 420 430 440 450 460 470 480 490 500 510 520 



SEQ_ID_NO_10 FDLYDVHCLSQ TSQLLGENSKESPS- -ACVDPBOAIFSRFLYTRWVANSPNRBISSLPLSKCLNLC 614 

SEQIDNOll - 250 

SEQIDNOIS VDLYDLHCLA VBNDCSANKDD ALFSRYI£TK-QROIPAICVYXWSLNSCLBVC-AGlIPTCAGANYintRIX2DCTLFDAIDDDABIN BHTDFYKNLCVTKBIDTGASAA 588 

SEQ_ID_NO_17 VDLYDLHCLAAMPL-VBNDCSANKDD- ALFSRYIBTK-QROIPAKVYKWSIJJSCLBVC-AGNPTCAGANYNRRIXmCTLFDAIDDDAEIN BHTDFYKNLCVTKBIDTGASAA 592 

SEQ_ID_NO_45 VDLYDLHCLAALPL-VBNDCSANXDD ALF SRYLHTK - QRGIPAKSYXW8LHSCLBVC - AGNPTCAGANYNRRIXIDCSLFDAIDBDABVN BHTDFYKKLCVTKEVDTGASAA 591 

SBQ_ID_NO_3 FDLVDLQCLS TKKDCSTKXNDIN PVKYLYSHFVKYLYSQ-QPOIPTTCTBJCVIGISKCLDLC-TDSBROTQL^^ SBHIDPYQNLCSTKBNSAGVSSA 592 

SBQ_ID HO_12 IDLYDLHCLE QTKVCAQKNN VTRFSRYLY8I - YDAVPSQFYBATALTNCLNIjC - AYTBRCBGVNYNRRNGRCELFDKVBGNOKPS DFTDFYKNLCLVBEVESEYSAA 587 

ruler 530 540 550 560 570 580 590 600 610 620 630 640 650 



seq id_no_10 anvpktqqat aafffsvqlttk- 

sbq"id_no_ii 

seq_id_no_16 anvpet khrvsgt- 

sbq id_no_17 anvpet khrvsgt- 

seq~id_no_45 anvpet khrvsgt- 

seq_id no_3 lnvpqssvipi sssqnisks- 

seq_id_no_12 anvpk hllpnvshs- 

ruler 660 670 680 690 700.. 



-PPQIRDLNN- 



-NNKTTHKBPNIKLPPQ- - -SAKPINGKTGKEQLPVGSKSFGVTNTRDDO 



-WBOKDS KSQLLATKKVKKPTIK- - -NTS- -HRRAPBS - 

-WBGKDS KSQLLATKKVKKPTIK- - -NTS- -HRRAPBS 

-WBOKDS KAQLLATKKVKKPTIK- - -HTB- -HRRAPBS 

- DVFAKKNLNKDG - - - NNQVNI YE P BKKYHFK - - -GSX- -NETSYBTGTVNXSHVBBVSBTLTN- 

-AVTQKQB AKLHI ISAKTKPFLR EQBAQRRAPBTXTAKSSSASGKVSGBAG- 

. .720 730 740 750 760 770 780 



691 
250 
634 
638 
637 
667 
651 



SEQ_ID_HO_10 
SEQ ID_NO_ll 
SEQ_ID_HO_16 
SEQ_ID_NO_17 
SEQ_ID_NO_45 
SEQ ID N03 
SEQ_ID_NO_12 
ruler 



ENS ITGTAPPP - 



•VDGIKLIIKFSPQVSIPSPVLIPAQBVBTICHYBaiSVQIKHSSPFSaVVFVRNKYDTCRVKUCB^ 



SGVBSGSLBKN- 



TVPIOP- - -PVEVKABAIQTICNYBaiKVQINNGEPFSaVIFVXOTCFDTCRVBTO 

TVPIGF---PVEVKAEAIQTICNYEGI3WQINNGEPFSaVIFVKNKFDTCRVBVANSNAATLVLa^ 

TVPLGP FVBVKABAIQTICNYBGIXYQINNGBFFSGVIPVXNKFDTCRVBVANSNAATLVL^ 

-IITAPFSIPKIPEGPLPVFILIFADQVQTICDYEGIKVQIKSPQSFTGVIFVXNHYBTCRVBVSNSDAATL^ 

- - SSTTFSIS S SGRLPGP - WQ IAPNA VQTVCNYBG IKVQHBNPKAFS GVI FVKNRYBTCRVBVTD S B SAP LVI GLPPNFG SKMVADBKVAAS B 
.820 830 840 850 860 870 880 890 900 910 



794 
250 
715 
719 
718 
771 
742 



SBQ_ZD_NO_10 KGNXTHG DTLLSIBGSKKQIBGGSST - BDIQLIHSQKDLKRSRRQLQ- RDCGLQDMDHGTYKTVIWQTNNLGIFGLVTSMDQLYBISCNYSSMLGGKVQT^ 918 

SEQ_ID_NO_ll - -- 250 

SEQ ID_NO_16 TGKNK TKKGB BTPIJCDBIBBPllQKRQAABFPJ>C&LVDIiLNOTYKSTWIQ/rNNLGIFG 823 

SEQ_ID_NO_17 TGKNK - TKKGB BTFUO)BIEBFRQKRQAABFRDCGLVDIJiNGTYKSTVVIQTNNLGIFGLVTSOT 82 7 

SBQ~ID_NO_45 TGKNK TKKGB ETPI*KEEIEEFRQKRQAAEFRDCGLVDLLNQTYKSTVVIQTNNM 82 6 

SEQ_ID_NO_3 TSSQN ITS NSGHKVVGRARRDTQEKSCGLTBIBNGXYKSTWIQTHNLGIP 873 

SBQ_ID_HO_12 ANIQP BISGG DKLDKPADBLRIRRQAIiBLH-RI>CGXQDMNNGTYKSTVVVQTNNLGIFGL 850 

ruler 920 930 940 950 960 970 980 990 1000 1010 1020 1030 1040 



SEQ_ID_NO_10 VLHQHGPVRSBRQSGBGPLIQAKLGDILBLKNBIMAHDBBLDFLVRDCFABFaTS - 

SEQ ID_NO_ll 

SEQ~ID_HO_16 VLMQLLN GDGTBQFLVQAKLGDXLSLRWBIMAMDDSLDPFVKNG 

SEQ_ID_HO_17 VLMQLLN GDGTBQFLVQAKLGDILBLRNBINAHDDBLDFFVKNG 

SEQ_ID_NO_45 VLMQLLN GDGTEQPLVQAKLGDILELRWEIMAMDDBLDFFVKNCHABP 

SBQ_ID_NO_3 VLMQMHA GTGDHQFILQAKLGDILEXJINEIMAMDEBLDFFVKDCHAEFQTG- 

SEQ_ID_NO_12 VLMQMLS GQGBPVLQAKLGDILQLRWBIMAM 

ruler 1050 1060 1070 1080 1090 



LIDGGCPTPAVAQKLIPGAIBIKSSAVKTTKHQAFRFDSSASIRVTCSVBICKGDCBFVBCALT 
LIDGGCPTFAVAQKLIFGAIBIKSSAVKTTKHQAFRFDSSASIRVTCEVBICKGDCBPVECALT 
QLIDGGCFTPAVAQKLIPaAIEVKSSAVKTTKMQAFRFDSTAS IRVTCBVE I CKGDCEAVECALT 
-AGGDBKLQLIEGGCPTPAVAQKLIPQPIKLQSSAVKIABI^}AFRFDSSSSVRITGNIBXCXGDC3UPATCDMH 



.1110. 



.1120. 



.1130. 



.1140 1150. 



.1160. 



.1170 



1045 
250 
949 
953 
952 
996 
881 



SBQ_ID_HO 10 
SEQ_ID_HO_ll 
SEQ_ID_NO_16 

SBQ_ID_HO_l 7 GGVKKSFGRKKRBVSHN- 

SEQ_ID_NO_3 OESKQS 

SBQ_ID_HO_12 

ruler 1180 1190. 



ATTSLVrVDPLQQQNSVIKTBQQQQFFISHSSXSKQIF 



TAKXSSSLFBAFTBAAGGRKXNLBLTTTNSBQQQLCLHK 



-X8BFBTHRYLIFRRSHATTS IVI IDPLQQVNBPV - 
- I BB FETNRYL I PRRSHATTS IVI ID PLQQVHB PV - 
- I BEPETNRYLIFRRSHATTSXVI IDPLQQVNBPV - 



OSICLNS 

- AMSRASTLDLLRBD - AHBVQMIEB OSICLNS 

- AMSRASTLDLLRBB - AHBVQVXBB GSICLNR 

- LMBKVS SLD LLABD P AKTLLK XKBTA - -HLNONLCMOK 



.1200 1210 1220 1230 1240 1250. 



.1270. 



.1280. 



.1290. 



.1300 



1175 
250 
1030 
1034 
1033 
1085 
881 



SEQ_ID_NO_10 WTLOGVFOTLLTLIVVQSGVAAKHLINRFIVGKRI- - 

SEQ_ID_NO_ll 

SEQ_ID_NO_16 VTVFAIFGTLAVLILGQTWIAHYAVRRFSSBKTA- - 
SEQ_ID_NO_17 VTVFAIFGTLAVLILGQTWIAHYAVRRFSSBKTA- - 
SBQ ID_NO_45 I TVF A I PGTLA VL I LGQ V IWAHYAVRRF S TEX T A - - 
SBQ_ID_NO_3 ITLFSVPGVLLSLIVVQAIVVTNYIFXRVMSSRKXTN 

SBQ__ID_HO_12 

ruler 1310 1320 1330 



1210 
250 
1065 
1069 
1068 
1122 
881 



Figure 



7 BEST AVAILABLE COP^ 



Matter No.: 12557-015001 Page 14 of 14 

Applicant(s): Michele Coutu Hresko et al. 

NEMATODE PAN AND ZP RECEPTOR-LIKE SEQUENCES 



SBQ_ID_NO_4 6 -MSPRVIFLLLGSFLTAQAVFECSSHETTAFVRIPRARLDQTPW 129 
ruler 1 lof.?""?20^?f 30 40 50 ?.60 70 80 90. f . t?T?iOO?tTf^TiaoffT?^*i2 offt?fT?i3 ^ 



SBQ_ID_NO_46 FERARNTQIJSGFViatSVTVKNREHCLSACIJUJKBFVCXSVN^ IDTTDMNCLSRQNRCaFSGGNLVFVKTTHTBXRYYDHTQSVEAQESYCLQlCCLDSUITPCR 258 

S2Q ID N018 FBRARNTQLBG7VKKSVTVBNRKHCLSACLKEKBFVCKSVNFHYOT - IDYTDHNCLSRQNRCaFSQaNLVFVKTTNTBZRYYDHTQSVBAQBSYCX^KCLDSIlITPCR 257 

SEQ_ID_NO_4 FERTRin^LEGFVKXSLQVTSREECLSTCLKESEFVCRSVNYHYBN^ 260 

ruler 140 150 160 170 180 190 200 210 220 230 240 250 260 



***, .*.***********,*******, * . ******.**•***•*********•******. *** *•,*# ************************ »,*.*******, t * : * ,.*.***** 

SSQ_ID_NO_4 6 
SEQ_ID_NO_18 

SEQ_ID_NO_4 SVEYSPSBKNCIVSDET)TY9RADQQaRVNNKDYYBPVCVAADLSSST^ 

ruler 270 280 290 300 310 320 330 340 350 360 370 380 



388 
387 
363 



.390 



SBQ_ZD_NO_4 6 C^LCTNEaiRFIVNTKSPYTOAIYAAERFSTCSQWKNAKQISITFPP 518 

SEQ_ID_NO_18 GMAWrnraQIRFXVimCBPrraAIYAAERPSTC^^ 517 

8EQ_ID_NO_4 383 

ruler 400 410 420 430 440 450 460 470 480 490 500 510 520 



SEQ_ID_NO_4 6 TAELQILNQLBEPLHKASIGDPLLLVITSEQAGPHNHMVTECT^ IHFCAGPNGCPVSNCLDSGTNELFMSHQ 648 

SEQ_ID_NO_18 SAELQILNQLBBPLHKASIGTPIJJjVITSBQAGPHNKMVTECTATRVGaFaDTVTPTLI IM7CAGPNGCP V3NCLD 3GTNBLFMSHG 647 

SEQ_ID_NO_4 TIIDNGCPRYFALVQFVLQDFDKNRLK8KMKAFRIJ)GSYDIQI^ 457 

ruler 530 540 550 560 570 580 590 600 610 620 630 640 650 



•******.« **.* *.* 



•:* :. 



SEQ_ID_NO_4 6 RKKRS -VDLEAGET-EERLSAIIRVFAKGBDEEEIEMG- -NNTLMTSLAES TDLLC IAEP FFVS SW S LSVLCFAL3AI IA IWGCHAIiHAKPTKQVAA 742 

SEQ_ID_NO_18 RKKRS -ADLBAOBT-BEKLSAIIRVFAKOBDEBBMEKA- -NNTMKTSMSDS TBLLCIAKPPFVSSWSLSVLCFALSAIIAIWGCHSLHSKPVKQVAA 741 

SEQ_ID_NO_4 RO^SrVDE^NTTTSABTLSAIIRVIJ^GBEELBVBBFYRNDTNPKYDSRKNIS 557 

ruler 660 670 680 690 700 710 720 730 740 750.. 



Figure 8 
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